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ABSTRACT Conspecific weeds retained characteristics from wild ancestors and also developed crop 
mimicries for adaptation and competitiveness. This research was conducted to identify quantitative trait loci 
(QTL) associated with the wild and crop-mimic traits and to determine haplotype variants for QTL-rich 
regions in U.S. weedy rice. An F 2 population from the cross between a cultivated (EM93-1) and a U.S. weedy 
(US1) rice line was evaluated for six wild and eight crop-mimic traits in a greenhouse to identify the QTL. A 
core collection of 27 U.S. weedy red rice lines and 14 AA-genome wild rice lines were determined for the 
haplotype variants. A total of 49 QTL were identified, with 45 collocated as clusters on 1 4 genomic segments. 
The number of haplotypes across the 14 segments was lower in the weedy (6.1 ± 2.4) than in the wild (7.5 ± 
1.8) rice sample. Both samples shared -50% haplotypes (wild-like). The EM93-1— like haplotypes accounted 
for a greater proportion (30 ± 26%) of the haplotypes in the weedy than in the wild (7 ± 10%) rice. Based on 
haplotype patterns for the 14 QTL cluster regions, 26 of the 28 red rice lines were clustered into two groups 
corresponding to the black-hull awned and straw-hull awnless morphological types, respectively. The QTL 
analysis demonstrated that conspecific weed-crop differentiation involved many genomic segments with 
multiple loci regulating natural variation for adaptation and competitiveness. The haplotype analysis revealed 
that U.S. weedy rice retained large blocks of linkage disequilibrium for the multiple loci from the wild relatives 
and also incorporated haplotypes from cultivars. 
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Weeds are unwanted plants that have adapted to human-disturbed 
environments and compete with crops for limited natural resources 
(Harlan 1965; Baker 1974; Booth et at 2003). Many crop species, such 
as barley (Hordeum vulgare), oat {Avena sativa), rice (Oryza sativa), 
sorghum (Sorghum bicolor), and wheat (Triticum aestivum), have 
conspecific or congeneric weeds (Ellstrand et al. 1999). Crops and 
their conspecific weeds share evolutionary origins and produce fertile 
hybrids resulting in gene flow from cultivars to the local weed pop- 
ulations. Conspecific weeds retained characteristics from wild relatives 
(e.g., seed shattering and dormancy) and also developed crop mim- 
icries (e.g., plant/seedling morphologies and rapid vegetative growth) 
to enhance the adaptation and competitiveness in agro-ecosystems. 
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Knowledge of specific genes regulating natural variation for the wild 
and crop mimic traits is essential to understand weed evolution and 
adaptation and to develop weed management and transgene mitiga- 
tion strategies. 

Weedy rice refers to various forms of unwanted plants that belong 
to the Oryza genus and are phenotypically intermediate between cul- 
tivated and wild (Oryza spp.) rice (Oka 1988; Delouche et al. 2007). 
Asian cultivated rice (O. sativa) originated from the wild ancestors 
(0. rufipogon and O. nivara) at multiple sites in South to Southeast Asia 
and differentiated into the indica and japonica subspecies, which are 
cultivated mainly in tropical and temperate areas, respectively (Kush 
and Brar 2002). Based on diagnostic characteristics for the subspecific 
and the wild-cultivated variation, weedy rice could be classified into 
indica- and japonica-\ike groups, with each consisting of wild-like 
and crop-mimic subgroups (Tang and Morishima 1997; Suh et al. 
1997). Red rice (i.e., red or brown pericarp-colored weedy rice) prevails 
in both indica- and japonica-Vikt weed populations. The indica-Yikz 
weed populations in Tropical Asia, where there was/is wild rice, could 
originate from natural hybridization between cultivated and wild rice 
or from natural variants of wild rice (Oka 1988). The japonicaAikt and 
some indica-\ike weed populations in the areas historically absent 
of wild rice (e.g., Europe and the United States) may originate from 
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old cultivars adaptive to the local environmental conditions or 
natural hybridization between indica and japonica cultivars (Tang 
and Morishima 1997; Suh et al. 1997; Ishikawa et al. 2005; Cao 
et al. 2006; Londo and Schaal 2007). U.S. weedy rice was likely derived 
from contaminants in imported seeds of commercial rice (Delouche 
et al. 2007) and has population structures closest to the indica and aus 
ecotypes of the indica subspecies (Reagon et al. 2010). 

Quantitative trait locus (QTL) analysis, map-based cloning, and 
gene-based haplotype analysis have been used to identify genes and 
their genomic organization or haplotype patterns for wild and crop 
mimic traits differentiated between weedy and cultivated rice. A total 
of 26 QTL were identified for 16 traits from a cross between & japonica 
cultivar and a japonica-Yike weedy rice line from France (Bres-Patry 
et al. 2001) and 20 QTL identified for five wild traits from a cross 
between an indica cultivar and a Thailand indica-Wke weedy rice line 
(Gu et al. 2005a; Ye et al. 2010). Many of the reported QTL were 
collocated on a small number of chromosomal segments. Some col- 
located QTL (weed adaptive haplotypes), such as qSD7-l/qPC7 or 
qSD4/qHC4 for seed dormancy and pericarp or hull color, could 
not be broken across several generations (Gu et al. 2005b). The 
qSD7-l/qPC7 QTL were map-based cloned as a single gene (Gu et al. 
2011), which is the red pericarp gene Rc encoding a bHLH family 
transcription factor (Sweeney et al. 2006; Furukawa et al. 2007). The 
functional Rc alleles in weedy red rice were clustered into two groups 
corresponding to the indica and japonica subspecies and the Rc alleles 
in U.S. red rice originated more likely from indica landraces than 
from wild rice (Reagon et al. 2010; Gu et al. 201 1). Haplotype analysis 
for the seed shattering genes qshl (Konishi et al. 2006) and sh4 (Li 
et al. 2006) suggested that these loci did not contribute to the trait 
variation in U.S. weedy rice populations (Thurber et al. 2010). Sim- 
ilarly, the haplotype diversity for the semi-dwarfl (sdl) region was not 
associated with the phenotypic variation for plant growth-related 
traits (Reagon et al. 2011). 

Despite the relatively short history of rice cultivation in the United 
States, weedy rice populations in the country have differentiated into 
many ecotypes and structures. Collected ecotypes are often classified 
based on seed morphologies (e.g., pericarp and hull colors, awn pres- 
ence/absence and grain types) and vary for multiple wild and crop 
mimic traits (Noldin et al. 1999; Delouche et al. 2007). The population 
structures were defined by genomic patterns for randomly selected 
short-DNA sequences (Reagon et al. 2010). There is still a considerable 
lack of knowledge on genes and genome structures underlying the 
trait variation and ecotype differentiation in weedy rice. Distinguish- 
ing from neutrally evolving DNA sequences, QTL for adaptive traits 
are targets of natural selections. Therefore, the objectives of this re- 
search were to identify QTL for wild and crop mimic traits, including 
those enhancing the competitiveness during the vegetative growth 
phase, and to determine haplotype patterns for the QTL-containing 
regions in U.S. weedy rice. 

MATERIALS AND METHODS 
Plant materials 

An F 2 population from the 'EM93-l7'USr cross was developed for 
QTL mapping. EM93-1 is a semi-dwarf line of indica-type cultivated 
rice and carries the mutant allele at sdl (Ye et al. 2013). US1 is a pure 
line of indica-like weedy rice from the United States and was used in 
the reported research (Tang and Morishima 1997, Suh et al. 1997). A 
core collection of 27 U.S. weedy rice lines and 14 lines of the AA- 
genome wild rice (O. rufipogon, O. nivara, and O. glumipatula), which 
were introduced from the National Small Grains Collection, the 



United States Department of Agriculture-Agriculture Research Ser- 
vice, were used to identify haplotype variants for mapped QTL 
regions. All the 28 (include the parent US1) weedy rice lines are red 
rice and represent four morphological types: black hull awned (BHA), 
furrow hull awned (FHA), straw hull awned (SHA), and straw hull 
awnless (SH). These weedy rice lines and some wild rice lines were 
planted in a greenhouse to confirm the seed morphologies (see Sup- 
porting Information, Table SI, Figure SI). 

Plant cultivation 

Fully after-ripened (dormancy-released) seeds from the Fj and paren- 
tal plants were germinated in a 30° incubator for 5 d and newly 
germinated seeds planted into 200-cell Plug trays filled with the rice 
nutrient solution (Yoshida et al. 1976) for 4 weeks to synchronize 
seedling size. About 500 F 2 seedlings were transplanted into pots 
(12x12x15 cm dimensions), with one plant per pot. The pots were 
filled with clay soil mixed with greenhouse medium (Sunshine Growth 
Mix) and placed in water-tight containers (60 cm xl20 cm) in a green- 
house. The greenhouse temperatures were set at 30/23° for day/night 
and the day length was natural, except for the period from 35 to 70 d 
after germination when a short-day (10 hr) treatment was applied to 
induce floral initiation. Flowering time was recorded as the date when 
the first panicle in a plant emerged from the leaf sheath. Seeds were 
harvested at 40 d after flowering, air-dried in the greenhouse for 3 d 
before stored in a freezer (-20°) to maintain the dormancy status. 

Phenotypic identification for wild and crop-mimic traits 

The 14 traits were measured with one or more parameters as 
described in Table 1. Methods used to measure the wild traits — seed 
shattering (SH), seed dormancy (SD), awn (AN), and hull color 
(HC) — were similar to those previously described (Gu et al. 2005a). 
Three reproductive tillers cut from a plant were gentiy shaken in 
a bucket for 10 sec to collect shattered seeds, and shattered and non- 
shattered seeds were counted to estimates a shattering rate for the 
plant. A sample of -30 seeds in a 9-cm Petri dish lined with a filter 
paper and wetted with 5 mL of water was germinated at 30° and 100% 
relative humidity in dark for 7 d. The mean germination (%) of three 
samples was used to estimate the degree of seed dormancy for a plant. 
AN was quantified by the mean awn length and percentage of seeds 
with awn for a sample of -50 seeds from a plant. HC was measured 
with the ChromaMeter Minolta CR310, which transfers reflectance 
spectra into the I*, a*, and b* readings to quantify blackness, redness, 
and yellowness, respectively. L* ranges from 0 to 100, with 0 and 100 
indicating completely nonreflective (black) and perfectly reflective 
(white), respectively, a* varies from —100 to 100, with negative and 
positive values indicating degrees of green and red pigmentations, 
respectively; a high, positive a* value indicates a high intensity of 
redness, b* varies from —100 to 100, with the negative and positive 
values indicating degrees of blue and yellow pigmentations, respec- 
tively; a high, positive b* value indicates a high intensity of yellow 
pigmentation. The pigment measurement was conducted using >100 
seeds in a 6-cm Petri dish on a dark background and repeated three 
times. 

The crop-mimic traits include plant height (PHR)- or tiller number 
(TNR)-increasing rates and seed setting percentage (SSP). Plant height 
and the number of tillers per plant were observed at 4, 6, and 8 wk 
after germination, and PHR and TNR were estimated as ratios of the 
observations between week 6 and week 4 or week 8 and week 6. SSP 
was evaluated as the proportion of the number of fertilized seeds to the 
total number of spikelets on the three panicles used to measure the 
shattering rate. 
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Table 1 List of wild and crop-mimic traits evaluated for the F 2 EM93-1/US1 population 



Traits (abbr.) 



Description 



Measurement 



Wild-like 

Leaf sheath color (LSC) 
Hull color (HC) 

Pericarp color (PC) 
Awn (AN) 
Shattering (SH) 
Seed dormancy (SD) 
Crop mimicry 

Plant height-increasing rate (PHR) 

Tiller number-increasing rate (TNR) 

Flowering time (FT) 

Plant height (PH) 

Reproductive tiller (RTN) 

Seed number (SN) 

Seed setting percentage (SSP) 

Seed weight (SW) 



Pigments on the leaf sheath 

Pigments on the hull (lemma and palea) 

Pigments on the pericarp tissue 
Long stick appendage with the lemma 
Seeds shattered during maturation 
Delayed germination 

Rate of increase in plant height 

Rate of increase in number of tillers 

Time period required for a plant to flower 

Height of a matured plant 

Tillers with seeded panicles 

Fertile florets in a plant. 

Percentage of filled seeds for a plant 

Dry weight of seeds from a plant 



Visualized as purple or green 

Visualized as black and straw and also quantified 

with reflectance spectra (L* a*, and b*) 
Visualized as red, brown, or white 
Average AN length and percent seeds with AN 
Percentage seeds shattered 
Germination % of air-dried seeds 

Differences in plant height between periods 

of 4, 6, and 8 wk after germination 
Differences in no. of tillers between periods 

of 4, 6, and 8 wk after germination 
Days to the emergence of the 1st panicle from 

the leaf sheath 
Length of the main stem from the base to the 

top of the panicle 
No. of reproductive tillers in a plant 
The total number of seeds 
Percentage of fertile/total florets averaged over 

three main panicles 
Weight (g) of 100 air-dried seeds 



Marker genotyping and linkage map construction 

Genomic DNA was extracted from fresh leaf segments of the parental 
and F 2 seedlings. Approximately 320 rice microsatellite markers 
(McCouch et al. 2002) were screened for polymorphism between the 
parents. Polymorphic markers were used to genotype a subpopulation 
of 188 F 2 plants to develop a framework linkage map to scan for QTL. 
The remaining F 2 s were genotyped only with the markers nearest to 
QTL peak positions to confirm their effects. DNA extraction, marker 
amplification by polymerase chain reaction, and marker display by 
electrophoresis with nondenatured polyacrylamide gel were performed 
using the previously described methods (Gu et al. 2004a). The linkage 
map was constructed using MAPMAKER/EXP 3.0 (Lincoln et al. 
1992). The marker genotyping data were also analyzed for segregation 
distortion by a chi-square test. 

QTL mapping 

The initial QTL analysis was conducted based on the subpopulation. 
The composite interval mapping program of the WinQTLCart 
software (Wang et al. 2006) was used to generate likelihood ratio 
distributions for individual traits to infer QTL positions and to esti- 
mate QTL additive (a) and dominance (d) effects. The QTL threshold 
was established by 1000 random permutations at a genome-wide type 
I error of 5%. The degree of dominance for a QTL was estimated by 
the d/a ratio. A QTL confidence interval was expressed by the length 
of 1-LOD (or 4.61 likelihood ratio) support region in centimorgan 
derived from Kosambi's map function. Two or more QTL with over- 
lapping 1-LOD support intervals were defined as a QTL cluster. 

The QTL detected by composite interval mapping in the sub- 
population were corroborated by single marker analysis (SMA) for 
data from the full population of -480 F 2 plants. The SMA was con- 
ducted using one-way analysis of variance with the marker nearest to 
a QTL peak as the indicator variable. The variables were coded as 0, 1, 
and 2 for the EM93-l-like homozygote, heterozygote, and USl-like 
homozygote, respectively, for the analysis. Analysis of variance was 
performed using the SAS GLM procedure (SAS Institute 2011). 



Haplotype analysis 

Haplotype analysis was conducted for the QTL cluster regions 
detected from the mapping population. The analysis was limited to 
the weedy and wild rice lines (Table SI) and did not include cultivars 
because some wild characteristics in weeds, such as a high degree of 
seed shattering and dormancy, black hull color, and long awn (Figure 
SI), are absent or very rare in cultivars. The 27 weedy and 14 wild rice 
lines were genotyped with all markers mapped on each of the genomic 
regions that encompass 1-LOD support intervals of the clustered QTL. 
The marker genotypes (nonallelic combinations) were used to deter- 
mine haplotype variants for individual QTL cluster regions. The phys- 
ical length of a haplotype was estimated based on marker positions on 
the Nipponbare genome sequence (Gramene 2012). The weedy and 
wild rice samples were compared for the number of haplotypes using 
a paired Student's r-test. The marker genotyping data were subjected 
to a cluster analysis using the single linkage method (SAS Institute 
2011) to group the U.S. weedy red rice lines and to infer phylogenetic 
relations among the weedy and wild rice lines. 

RESULTS 

Phenotypic variation and correlation 

The parental lines US1 and EM93-1 were divergent in phenotype for 
the six wild traits, which varied in segregation pattern in the F 2 pop- 
ulation. Both leaf sheath (LSC) and pericarp (PC) colors were scored 
as qualitative traits; the F 2 plants could be grouped into the two 
(purple and green) types for LSC or three (red, brown, and white) 
types for PC (Figure 1, A and B). The purple or brown type was USl- 
like, whereas the green or white type was EM93-l-like for LSC or PC. 
The red type of the F 2 s resembled the hybrid FxS for PC, presumably 
resulting from the complementation of non-allelic genes assembled 
from the two parents. Both SH and SD are quantitative traits and their 
F 2 frequency distributions approximated to normal distributions (Fig- 
ure 1, C and D). The HC and AN traits appeared to be of both 
qualitative and quantitative natures. For HC, the F 2 population con- 
sisted of black, brown, furrow, golden, and straw hull— colored plants 
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Figure 1 Frequency distributions of wild (A-F) and crop-mimic (G-L) traits in the F2 EM93-1/US1 population. Arrows indicate the parental means. 



and some plants had the hull tissue with multiple colors. However, 
using the component reflectance spectrum for blackness (L*) or yel- 
lowness (b*), the F 2 s could be divided into basically two groups, 
although variation occurred within each group (Figure IE). For AN, 
the F 2 population consisted of -86% awned and -14% awnless plants. 
However, the awned plants varied in awn length from -0.5 to 5 cm or 
percentage of awned seeds from -2 to 100%, and there was a nonlinear 
correlation between these two quantitative measurements (Figure IF). 

The parent USl had later flowering time (FT), less reproductive 
tillers/plant (RTN), and taller plant height (PH) than the parent 
EM93-1 and was similar to EM93-1 in seed numbers/plant (SN), seed 
weight (SW), and seed setting percentage (SSP). In the F 2 population, 
FT, PH, RTN, SN, and SW displayed approximately normal frequency 
distributions and transgressive segregation (Figure 1, G— K), and only 
SSP had a negatively skewed frequency distribution (Figure 1L). 

Phenotypic variation for the plant vegetative growth-related traits 
(i.e., plant height and tiller number) was observed between the two 
parental lines and among the F 2 individuals. The range of the ob- 
served variations increased with times from week 4 to week 8 in the F 2 
population (Figure 2, A and D). Both PHR and TNR exhibited ap- 
proximately normal frequency distributions for each of the two (weeks 
4—6 and weeks 6—8) periods (Figure 2, B and C, E and F). 

Each of the 14 traits measured was correlated with three (HC) to 
10 (AN) of the remaining 13 traits. The correlation occurred between 
wild traits, crop-mimic traits, and wild and crop-mimic traits (Table 
2). For example, SD was correlated with the other five wild traits and 
also with the two crop-mimic traits FT and SW; and the two vegeta- 
tive growth-related traits PHR and TNR were negatively correlated 
(r = -0.21). The phenotypic correlations suggest that none of the 
adaptive traits was inherited independently. 



The framework linkage map 

Of the 320 markers screened, 173 are polymorphic between EM93-1 
and USl, with the polymorphic rate being 54%. The subpopulation of 
188 F 2 plants was genotyped with 123 polymorphic markers and the 
genotyping data (no missing data) were used to construct a framework 
linkage map (Figure 3). The total length of the map was 1423 cM, with 
the average inter-marker distance being 13 (±7) cM. Segregation 
distortion was detected for markers on seven segments of chromo- 
somes 1 and 7 to 12. Four and three of the seven segments had allelic 
frequencies in favor of EM93-1- and USl-derived alleles, respectively 
(Figure 3, see Table S2). 

QTL associated with wild and crop-mimic traits 

A total of 21 QTL were associated with the six wild traits in the 
subpopulation, including one for LSC, three for PC, three for HC, six 
for AN, three for SH, and five for SD (Table 3, see Figure S2, A— F). 
The number of QTL for a trait varied with the measurements. For 
example, only one QTL for PC (qPC7, R 2 = 0.59) or HC (qHC4, R 2 = 
0.69) was detected when plants were scored as the presence (1) or 
absence (0) of the pigments. qPC7 was confirmed and additional two 
loci (.R 2 < 0.09) were detected for PC when it was scored as red (2), 
brown (1), and white (0). Similarly, qHC4 was confirmed and addi- 
tional two HC loci (R 2 <0.09) were detected when the trait was 
quantified with the reflectance spectra (L*, a*, and b* values). Like- 
wise, three awn loci (i.e., qAN4, 6.1, and 8) were associated with both 
awn length and percentage of awned seeds and the remaining three 
QTL (i.e., qAN2, 3, and 6.2) associated with only one of the two 
measurements. The parent EM93-1 contributed alleles increasing seed 
dormancy for qSD1.2, black pigment for qHC8, red pigment for qPC6, 
and awn length or awned-seed for qAN6.1, whereas the parent USl 
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Figure 2 Dynamic patterns of plant vegeta- 
tive growth in the F2 EM93-1/US1 population. 
(A, D) Plant height and the number of tillers 
per plant at weeks 4, 6, and 8. (B— C, E— F) 
Frequency distributions of the plant height- 
and tiller number-increasing rates for week 
6/week 4 (w6/w4) or w8/w6. Arrows indicate 
the parental means. 
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contributed the effect-increasing alleles to the remaining 17 QTL. 
Approximately one half of the 21 QTL had a strong dominance effect 
(d) on PC (qPC6 and 7), HC (qHCl, 4 and 8), AN (qAN6.2), SH 
(qSH7), or SD (qSD4 and 7.2), as suggested by the dla ratios of SI 
or < -1 (Table 3). 

A total of 15 QTL were associated with the six crop-mimic traits, 
including three for FT, one for RTN, four for PH, two for SN, two for 
SW, and three for SSP, in the subpopulation (Table 4, see Figure S2, 
G-K). The QTL qPHl (R 2 = 0.38) and qFT7 {R 2 = 0.13) had a rela- 
tively large effect on PH or FT, and the remaining 13 loci contributed 
<9% to the total phenotypic variances. The US1- and EM93- 
1 — derived alleles increased additive effects for 12 and three of the 
15 QTL, respectively. Of the 15 QTL, six displayed a strong domi- 
nance effect on FT, RTN, SW, or SSP (Table 4). 

Six QTL (qPHv) were associated with plant height at weeks 4, 6, or 
8 of the vegetative growth phase (Table 5, see Figure S2, L and M). 



The qPHvl and qPHv8 QTL were mapped to the same positions as 
qPHl and qPHS, respectively (refer to peak positions and nearest 
markers in Table 4 and Table 5), suggesting that qPHvl and qPHl 
and qPHv8 and qPH8 are underlain by same genes expressed in both 
of the vegetative and reproductive growth phases. The remaining four 
qPHv QTL appeared to express only in given period(s) of the vege- 
tative growth phase. Four QTL were associated with plant height- 
increasing rate (PHR) and three (qPHRl, 2, and 6. J) of the four collocated 
with the qPHv loci (qPHvl, 2, and 6). The other PHR QTL {qPHR6.2) 
appeared to express only from weeks 4 to 6. US1 and EM93-1 con- 
tributed the effect-increasing alleles to the qPHRl and 6.1 and the 
qPHR2 and 6.2 loci, respectively. 

One and two QTL were associated with tiller numbers (qTNvl) 
and tiller number-increasing rate (qTNRl and 5), respectively, during 
the vegetative growth phase (Table 5, see Figure S2N). Both qTNvl and 
qTNRl were same for the peak position and had the effect-increasing 



Table 2 Summary of correlation coefficients (r) between adaptive traits segregating in the F2 EM93-1/US1 population 



Trait" 








Wild Trait b 












Crop-Mimic Trait 








SH 


SD 


PC 


HC 


AN 


LSC 


PH 


FT 


RTN 


SN 


SW 


SSP 


PHR 


TNR 


Wild-like 


SH 




<0.002 


0.035 


0.195 


<0.001 


0.914 


0.041 


<0.011 


<0.009 


<0.001 


0.035 


<0.001 


0.178 


0.731 




SD 


0.14 




<0.001 


<0.001 


<0.001 


<0.003 


0.447 


0.023 


0.329 


0.114 


0.035 


0.176 


0.141 


0.997 




PC 


0.10 


0.19 




0.209 


<0.008 


0.190 


<0.001 


<0.001 


<0.001 


<0.007 


0.481 


0.022 


0.676 


0.606 




HC 


-0.06 


-0.20 


-0.04 




<0.001 


0.023 


0.073 


0.298 


0.547 


0.063 


0.749 


0.954 


0.844 


0.762 




AN 


0.18 


0.22 


0.12 


-0.23 




0.222 


<0.001 


<0.001 


<0.005 


<0.001 


<0.001 


<0.001 


0.936 


0.798 




LSC 


0.00 


0.13 


0.06 


-0.14 


-0.06 




0.082 


<0.001 


0.963 


0.103 


0.035 


0.126 


<0.001 


<0.001 


Crop mimic 


PH 


0.09 


-0.03 


0.19 


-0.08 


0.23 


0.08 




0.451 


<0.001 


<0.001 


0.055 


<0.001 


<0.001 


<0.005 


FT 


0.12 


-0.10 


0.43 


-0.05 


0.20 


-0.37 


-0.03 




<0.001 


0.942 


<0.001 


0.971 


<0.001 


0.220 




RTN 


-0.12 


0.04 


-0.20 


0.03 


-0.13 


0.00 


-0.42 


-0.22 




0.609 


<0.001 


0.249 


<0.001 


<0.001 




SN 


0.21 


0.07 


0.12 


-0.01 


0.28 


0.07 


0.28 


0.00 


-0.02 




0.019 


<0.001 


0.047 


0.731 




SW 


0.10 


0.10 


0.03 


0.08 


0.40 


-0.10 


0.08 


0.16 


-0.17 


0.12 




<0.001 


0.657 


0.541 




SSP 


0.19 


0.06 


0.10 


-0.06 


0.27 


0.07 


0.15 


0.00 


-0.05 


0.56 


0.19 




0.153 


0.366 




PHR 


0.06 


0.07 


0.02 


-0.01 


0.00 


0.23 


0.26 


-0.20 


-0.15 


0.09 


0.02 


0.06 




<0.001 




TNR 


0.02 


0.00 


-0.02 


0.01 


-0.01 


-0.13 


-0.30 


0.06 


0.26 


-0.02 


-0.03 


-0.04 


-0.21 




Pair c 




9 


7 


8 


3 


10 


6 


8 


8 


8 


7 


8 


6 


6 


4 



SH, seed shattering (shattering rate); SD, seed dormancy (% germination); PC, pericarp color (3-color measurement); HC, hull color (L*); AN, awn (awn length); LSC, leaf 
sheath color; PH, plant height; FT, flowering time; RTN, number of reproductive tillers/plant; SN, no. of seed/plant; SW, seed weight; SSP, seed setting percentage; 
PHR, the week6/week4 plant height ratio; and TNR, the week6/week4 tiller number ratio, 
k Traits were evaluated for 480-493 F 2 plants. 

Listed below and above the diagonal line are rvalues and their probability (P) levels, respectively. Values significant at P < 0.05 are shown in bold. 

Number of trait pairs with a significant correlation in the column. 
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Figure 3 Framework linkage map and QTL distribution. The map was constructed based on 188 plants from the F2 population. Vertical bars 
represent 12 chromosomes marked with rice microsatellite loci. Map distance in centiMorgan was derived from Kosambi's map function. 
Chromosomal segments with markers displaying a segregation distortion are depicted by gray-colored or striped bars, which indicate the 
distortion in favor of the alleles from the parental line EM93-1 or US1. The marker in bold indicates its segregation ratio deviated most from 
the Mendelian expectation (Table S2). The lengths of black bars right to each chromosome indicate 1 -LOD support regions for the named QTL 
associated with wild and crop mimic traits. The empty triangles point to the peak position of the QTL in the chromosome. QTL with overlapping 
1-LOD support regions are grouped as a QTL cluster (CL). Underlined markers were used to determine haplotype patterns for the QTL cluster 
regions in weedy and wild rice lines (Table S1). Known genes qsh7, sd7, sh4, Rc, and SD7-1 were positioned on the map according to their 
physical positions on the rice reference genome sequence (Gramene 2012). 



allele from EM93-1. qTNR5 (R 2 = 0.04) contributed less to the phe- 
notypic variance than qTNRl (R 2 = 0.14) and had the effect-increasing 
allele from US1. 

The aforementioned 49 QTL detected in the subpopulation were 
all confirmed with the full population, as shown by significant levels 
(P values) from the SMA for the nearest markers (Table 3, Table 4, 
and Table 5). In general, SMA underestimated the QTL effects (as 
shown by R 2 values), mainly because of distances between the marker 
and QTL peak positions. These 49 QTL were located on 10 chromo- 
somes (1 to 9 and 12), with 45 (92%) of them collocated as clusters on 
14 chromosomal regions (Figure 3). Of the 14 QTL clusters (CL), one 
encompasses the QTL for crop-mimic (CL5), two contain the QTL for 
wild traits (CL4.2 and CL7.2), and the remaining 11 have QTL for 
both wild and crop mimic traits. 

Haplotype variants for QTL cluster regions 
in U.S. weedy rice 

The physical length of haplotypes for the 14 QTL clusters varies from 
3.2 to 12.0 Mb. The number of haplotypes varied with the clusters 
from two to 10 in the sample of 28 U.S. weedy rice lines and from four 
to 10 in the sample of 14 wild rice lines (Table 6). The correlation 
between the lengths and haplotype numbers was not significant in the 
weedy (r = 0.38, P = 0.18) or wild (r = 0.03, P = 0.94) rice samples, 
suggesting that the physical size was not a major contributor to the 
haplotypic variation. On average, the number of haplotypes was sig- 
nificantly smaller (f = -3.4, P = 0.002) in the weedy (6.1 ± 2.4) than in 
the wild (7.5 ± 1.8) rice sample, indicating that U.S. weedy rice is less 
diverse than wild rice for the QTL-rich regions. 

Shared haplotypes between the weedy and wild rice samples 
ranged from 10 to 100%, with the average being 52%, across the 14 
cluster regions (Table 6). The EM93-1— like haplotypes were present 



in 10 of the 14 QTL regions and accounted for 30% of the haplotypes 
on average in the weedy rice sample. In contrast, the EM93-1— like 
haplotypes were present only in six of the 14 regions and represent 7% 
of the haplotypes in the wild rice sample. Of the four QTL cluster 
regions (CL1.1, 1.2, 5, and 7.1) absent of the EM93-l-like haplotypes, 
CL1.1 and CL1.2 link on an interval of -40 cM containing sdl and 
CL7.1 contains Rc (Figure 3). The absence of the EM93-1— like hap- 
lotypes indicates that U.S. red rice populations have maintained rela- 
tively large blocks of linkage disequilibrium (LD) to cover the wild- type 
alleles Sdl and Rc since the founder genotypes were introduced into the 
new continent. The US 1 -like haplotypes were present in all the 14 QTL 
regions and account for 22% of the haplotypes in the sample of U.S. 
weedy rice. In contrast, the US 1 -like haplotypes were present in six of 
the 14 QTL regions and represent 11% of the haplotypes in the wild 
rice sample. 

Based on the haplotype data for the 14 QTL cluster regions, the 28 
weedy rice lines were separated from most (9/14) of the wild rice lines 
and 26 of the 28 clustered into two groups (Figure 4). Group I 
includes 10 (nine BHA and one FHA) weedy and 3 BHA wild rice 
lines, while Group II consists of 16 (15 SH and 1 FHA) weedy rice. 
The other two red rice lines, R01 (PI-506229, SHA) from California 
and R09 (PI-653420, FHA) from Louisiana, appeared to be close to 
FHA or SHA wild rice. 

DISCUSSION 

Trait correlations and QTL clusters 

This research confirmed the previous observation that wild traits are 
interrelated phenotypically (Gu et al. 2005c) and also revealed that the 
phenotypic correlation is common between wild and crop-mimic 
traits in weedy rice. For example, the characters seed shattering, awn 
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■ Table 3 List of QTL associated with wild traits in the F 2 EM93-1/US1 population 



QTL a 


Ch. 






CIM (N = 


188) b 






SMA (N > 480) c 




Peak 


LR 


3 




d/a 


R 2 


Marker 


P 


R 2 


Leaf sheath color 






















qLSC6 


6 


35 


102 


0.8 


0.5 


0.6 


0.42 


RM204 


<0.0001 


0.31 


Pericarp color 






















qPC6 (red/brown/white) 


6 


122 


13 


-0.1 


0.3 


-3.0 


0.03 


RM5314 


0.0001 


0.01 


qPC7 (present/absent) 


7 


39 


236 


0.2 


0.5 


2.5 


0.59 


RM6018 


<0.0001 


0.76 


(red/brown/white) 




35 


368 


0.9 


0.9 


1.0 


0.84 


RID12/Rc 


<0.0001 


0.83 


qPC12 (red/brown/white) 


12 


39 


17 


0.3 


-0.1 


-0.3 


0.04 


RM7003 


0.0001 


0.02 


Hull color 






















qHC7 (blackness L*) 


1 


101 


19 


-0.74 


-7.4 


10.0 


0.08 


RM5972 


0.0006 


0.03 


qHC4 (visual score) 


4 


80 


224 


0.9 


0.7 


0.8 


0.69 


RM252 


<0.0001 


0.59 


(blackness /_*) 




79 


38 


-9.1 


-10.9 


1.2 


0.31 


RM252 


<0.0001 


0.52 


(redness a*) 




79 


75 


-1.2 


-1.4 


1.2 


0.39 


RM252 


<0.0001 


0.48 


(yellowness fa*) 




79 


190 


-7.4 


-7.9 


1.1 


0.57 


RM252 


<0.0001 


0.51 


qHC8 (blackness-L*) 


8 


33 


50 


21.5 


20.8 


1.0 


0.01 


RM3778 


0.0038 


0.02 


Awn 






















qAN2 [% awned seeds) 


2 


108 


12 


9.2 


5.5 


0.6 


0.02 


RM5363 


0.0101 


0.02 


qAN3 (awn length) 


3 


25 


22 


0.7 


-0.4 


-0.6 


0.11 


RM545 


0.0001 


0.04 


qAN4 (awn length) 


4 


50 


57 


1.0 


0.1 


0.1 


0.24 


RM 16757 


<0.0001 


0.11 


(% awned seeds) 




51 


77 


31.4 


16.9 


0.5 


0.17 


RM16757 


<0.0001 


0.09 


qAN6.1 (awn length) 


6 


40 


24 


-0.6 


0.2 


-0.3 


0.12 


RM204 


0.0011 


0.03 


(% awned seeds) 




40 


19 


-14.2 


11.3 


-0.8 


0.12 


RM204 


0.0003 


0.04 


qAN6.2 (awn length) 


6 


98 


25 


0.5 


-0.5 


-1.0 


0.14 


RM5314 


<0.0001 


0.06 


qAN8 (% awned seeds) 


8 


82 


39 


26.9 


11.5 


0.4 


0.10 


RM6765 


<0.0001 


0.11 


(awn length) 




81 


28 


0.7 


-0.1 


-0.1 


0.11 


RM6765 


<0.0001 


0.10 


Seed shattering (%) 






















qSH3 


3 


97 


33 


14.6 


10.7 


0.7 


0.09 


RM1334 


<0.0001 


0.08 


qSH7 


7 


72 


15 


1.0 


11.9 


11.9 


0.01 


RM6403 


0.0118 


0.02 


qSH9 


9 


12 


12 


8.5 


-2.0 


-0.2 


0.07 


RM5515 


0.0102 


0.02 


Seed dormancy (% germination) 






















qSDT.2 


1 


136 


12 


8.4 


2.5 


0.3 


0.03 


RM472 


0.0099 


0.02 


qSD4 


4 


81 


26 


-11.0 


11.2 


-1.0 


0.17 


RM252 


0.0051 


0.02 


qSD6 


6 


38 


18 


-11.0 


7.7 


-0.7 


0.13 


RM204 


0.0302 


0.01 


qSD7. 1 


7 


34 


12 


-5.2 


-3.9 


0.8 


0.02 


RID12 


0.0004 


0.03 


qSD7.2 


7 


86 


17 


-4.5 


24.1 


-5.4 


0.12 


RM6403 


0.0096 


0.02 



QTL, quantitative trait loci; Ch, chromosome; CIM, composite interval mapping; SMA, single marker analysis; ANOVA, analysis of variance. 
, Measurements in parenthesis are used to detect the QTL. 

Likelihood ratio (LR) at the peak (cM) position, additive (a) and dominance (d) effects, and proportion of the phenotypic variance explained by the locus (P 2 ) were 

computed by the CIM program based on a subpopulation of 188 (N) F 2 plants. 

The marker nearest the peak was used to confirm the QTL with the whole population of 480-493 (N) F 2 plants by SMA. The F-test probability level (P) and P 2 were 
estimated by one-way ANOVA. 



presence, red/brown pericarp color, purple leaf sheath color, and seed 
dormancy were positively or negatively correlated with several of the 
eight crop mimic traits in the large F 2 population (Table 2). Natural 
selection tended to assemble the wild characters, also known as "adap- 
tive syndromes," to make a weed plant weedy (Harlan 1965; Oka 
1988). The natural selection in agro-ecosystems is expected, to some 
extent, to impact weed phenotypes for multiple crop-mimic traits, 
including plant height- and tiller number-increasing rates that contrib- 
ute to early plant competitiveness. The plant height and tiller number 
traits differed in dynamic pattern during the vegetative growth phase 
(Figure 2, A and D) and were correlated with the final plant height and 
number of reproductive tillers (Table 2). However, the correlations 
accounted for only <10% of the phenotypic variances, suggesting that 
there are additional genes imparting the early plant competitiveness. 
QTL analysis in the previous and this research provided valuable in- 
sight into genetic basis and genomic structures underlying individual 
wild and crop-mimic traits and trait correlations. 

More QTL were detected for wild and crop-mimic traits in this 
than in the reported research on weedy rice (Bres-Patry et al. 2001; Gu 



et al 2005a; Jing et al. 2008; Subudhi et al. 2012; Thurber et al. 2013). 
In addition to the difference in the parental combination of mapping 
population from the previous research, a larger population size, the 
greenhouse condition, and more trait measurements used in this re- 
search must have also contributed to the QTL detection. It was un- 
foreseen that greater than 90% of the QTL collocated as clusters on 14 
genomic regions. The QTL distribution patterns on the genome explain 
the observed trait correlations. In addition, more QTL clusters detected 
in this research indicate that "a small number of chromosomal seg- 
ments" (Bres-Patry et al. 2001) is insufficient to explain the conspecific 
weed-crop differentiation for adaptive or domestication-related traits. 

Both correlated and uncorrelated traits may have their QTL 
collocated on the same genomic segments. It was estimated that 
correlated traits had more collocated QTL or shared QTL clusters 
(33% on average) than uncorrelated traits (11%) (Gardner and Latta 
2007). Of the 49 pairs of trait correlation listed in Table 2, 22 (45%) 
had one or more shared QTL clusters, which is higher than the pre- 
vious estimation. On the other hand, of the 70 QTL pairs in the 14 
clusters, 45 (64%) displayed trait correlations (29 positive and 16 
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Table 4 List of QTLs associated with crop-mimic traits expressed during the reproductive growth phase in the F 2 EM93-1/US1 
population 



QTL a 


Ch. 






CIM (N = 


188) fa CIM 






SMA (N > 480) c 




Peak 


LK 


a 


U 


U/3 


K 


Marker 


P 


f\ 


Flowering time, d 






















qFT1 


1 


143 


12 


-2.1 


-1.0 


0.5 


0.02 


RM11988 


0.0049 


0.02 


qFT6 


6 


29 


21 


-2.8 


-2.7 


1.0 


0.02 


RM204 


<0.0001 


0.06 


qFT7 


7 


41 


99 


5.8 


3.3 


0.6 


0.13 


RM6018 


<0.0001 


0.08 


Number of reproductive tillers per plant 






















qRTN5 


5 


31 


17 


0.2 


-1.7 


-8.5 


0.07 


RM169 


<0.0001 


0.06 


Plant height, cm 






















qPHl 


1 


142 


255 


22.3 


9.2 


0.4 


0.38 


RM472 


<0.0001 


0.17 


qPH5 


5 


105 


26 


6.0 


-0.1 


-0.02 


0.05 


RM7081 


<0.0001 


0.04 


qPH6 


6 


100 


20 


5.4 


0.1 


0.02 


0.03 


RM3827 


0.0044 


0.03 


qPH8 


8 


95 


20 


5.1 


-2.6 


-0.5 


0.05 


RM6765 


0.0001 


0.04 


Number of seeds per plant 






















qSN1 


1 


143 


31 


98.3 


81.5 


0.8 


0.02 


RM 11988 


<0.0001 


0.05 


qSN9 


9 


5 


13 


65.1 


3.0 


0.05 


0.05 


RM5515 


0.0091 


0.02 


100-seed weight, g 






















qSW1 


1 


101 


22 


-0.1 


-0.4 


4.0 


0.01 


RM5972 


0.0257 


0.01 


qSW8 


8 


33 


248 


1.2 


1.2 


1.0 


0.03 


RM3778 


0.0038 


0.02 


Seed setting percentage 






















qSSPI 


1 


87 


12 


0.1 


-0.2 


-2.0 


0.07 


RM6073 


0.0274 


0.01 


qSSP3 


3 


30 


12 


0.2 


-0.1 


-0.5 


0.06 


RM545 


0.0028 


0.02 


qSSP4 


4 


43 


12 


0.2 


-0.2 


-1.0 


0.08 


RM 16757 


<0.0001 


0.05 



QTL, quantitative trait loci; Ch, chromosome; CIM, composite interval mapping; SMA, single marker analysis; ANOVA, analysis of variance. 
. Measurements in parenthesis are used to detect the QTL. 

Likelihood ratio (LR) at the peak (cM) position, additive (a) and dominance (d) effects, and proportion of the phenotypic variance explained by the locus (R 2 ) were 

computed by the CIM program based on a subpopulation of 188 (N) F 2 plants. 

The marker nearest the peak was used to confirm the QTL with the whole population of 480-493 (N) F 2 plants by SMA. The F-test probability level (P) and R 2 were 
estimated by one-way ANOVA. 



negative), and the remaining 21 (36%) did not correlate phenotypi- 
cally in the mapping population (see Table S3). For example, qSD1.2 
was clustered with qSNl, qPHl, qPHRl, and qTNRl in cluster CL1.2, 
although SD was not correlated with any of the SN, PH, PHR, and 
TNR traits. A higher level of the correspondence between trait corre- 
lations and QTL clusters was observed for the 5 wild traits (SH, SD, 
AN, HC, and PC) in our previous research, most likely because their 
heritability levels were relatively high (Gu et al 2005a). Genetic back- 
ground effects also interfere with the expression of trait correlation or 
clustered QTL. For example, the correlation between seed dormancy 
and plant height and their QTL clusters qSD1.2/qPHl and qSD7.2/qPH7 
were detected only in advanced backcross populations (Ye et al. 2013). 
Additional data can be found at Table S4, Table S5, and Table S6. 

There is evidence that trait correlations in weedy rice could arise 
from pleiotropic effects of single genes, or the linkage between genes 
for the correlated traits, or both pleiotropy and linkage, although these 
two mechanisms are often confounded at the QTL level. Cluster CL7.1 
consists of qPC7 for pericarp color, qSD7.1 for seed dormancy, and 
qFT7 for flowering time. Both qPC7 and qSD7.1 are underlain by Rc 
and share the same functional nucleotide polymorphism or FNP (Gu 
et al, 2011), whereas qFT7 has been finely mapped to the position 
a few centimorgan (~6 Mb) from Rc and genetically isolated from 
qSD7.1 (Gu and Foley 2007 and unpublished data). Cluster CL1.2 
encompasses eight QTL for plant height, seed dormancy, and other 
growth- related traits and also the known gene sdl encoding gibber- 
ellin 20-oxidase. Fine mapping, gene expression analysis, and gibber- 
ellin induction analysis indicated that weedy rice carries the wild allele 
at sdl to promote seedling/plant growth and also suggested that the 
mutant allele sdl from the parent EM93-1 most likely has a pleiotropic 
effect on enhanced seed dormancy (Ye et al. 2013). Recent research 



identified additional FNPs at sdl (Asano et al. 2011) and new plant 
height QTL (Kovi et al. 201 1) in the CL1.2 region, suggesting that a QTL 
cluster could be underlain by multiple molecular genetic mechanisms. 

Implications of haplotypic variants 
for the multi-QTL regions 

The haplotype analysis revealed that U.S. weedy red rice populations 
are highly differentiated in genomic regions containing multiple QTL 
responsible for adaptation and competitiveness. The haplotype di- 
versity suggests that the wild and crop-mimic traits segregating in the 
mapping population may also differentiate among ecotypes or 
populations of weedy rice. About 50% of the haplotypes in the 28 
weedy rice lines were also present in the 14 lines of the AA-genome 
wild rice (O. spp.). These shared wild-like haplotypes were presumably 
derived from introduced founder genotypes, as there was no wild rice 
in North America, and have been maintained by LD and natural 
selection during the evolution of red rice populations. Some of the 
LD blocks are > 10 Mb in length (e.g., CL2 and 4.1) or 100% wild-like 
(e.g., CL1.1, 6.1, and 6.2). The parental line EM93-1 was developed by 
a hybridization of two semi-dwarf indica cultivars from China (Gu 
et al. 2004b), which have no direct pedigree relationship with the 
U.S. weedy rice lines. The presence of EM93-1— like haplotypes in 
the weedy rice sample could not be a random event, for the fre- 
quency (30%) is much higher than that (7%) in the wild rice sample. 
Therefore, the haplotypes other than the wild-like ones in U.S. red 
rice populations were most likely incorporated from old cultivars. 
The haplotypes like the weedy rice parent US1 accounted for ~20% 
of the haplotypes in the 28 red rice lines. This is indicative that the 
current U.S. weedy rice populations may originate from multiple 
founder genotypes. Further research could extend the haplotype 



1056 I M. S. Mispan et al. 



"~5tG3'Genes I Genomes | Genetics 



Table 5 List of QTL associated with crop-mimic traits expressed during the vegetative growth phase in the F 2 EM93-1/US1 population 



QTL a 


Ch. 






CIM (N = 1f 


38) b CIM 






SMA (N > 480) c 




reaK 


I P 

Lr\ 


a 


O 


O/a 


f\ 


Marker 


P 


A 


Tiller number 






















qTNv^ (w6) 


1 


139 


56 


-0.5 


-0.2 


0.4 


0.14 


RM472 


<0.0001 


0.04 


(w8) 




139 


53 


-1.7 


-0.7 


0.4 


0.14 


RM472 


<0.0001 


0.04 


Tiller number-increasing rate, % 






















qTNRI (w6/w4) 


1 


139 


47 


-52 


-22 


0.4 


0.14 


RM472 


<0.0001 


0.04 


qTNR5 (w8/w6) 


5 


33 


12 


10 


4.7 


0.5 


0.04 


RM169 


0.0071 


0.01 


Plant height, cm 






















qPHvl (w4) 


1 


142 


99 


3.0 


-0.5 


-0.2 


0.43 


RM472 


<0.0001 


0.12 


(w6) 




142 


106 


6.5 


-0.5 


-0.1 


0.38 


RM472 


<0.0001 


0.14 


(w8) 




140 


78 


8.5 


-0.1 


-0.01 


0.29 


RM472 


<0.0001 


0.09 


qPHv2 (w6) 


2 


118 


40 


-3.6 


0.2 


-0.1 


0.11 


RM5363 


<0.0001 


0.08 


(w8) 




114 


32 


-5.1 


0.3 


-0.1 


0.11 


RM5363 


<0.0001 


0.06 


qPHv3 (w4) 


3 


30 


16 


0.5 


-1.1 


-2.2 


0.06 


RM545 


<0.0001 


0.06 


qPHv6 (w6) 


6 


37 


21 


2.8 


0.3 


0.1 


0.06 


RM204 


<0.0001 


0.04 


(w8) 




36 


21 


4.4 


1.0 


0.2 


0.06 


RM204 


0.0004 


0.03 


qPHv7 (w8) 


7 


1 


12 


-1.7 


1.4 


-0.8 


0.05 


RM481 


0.0012 


0.02 


qPHv8 (w8) 


8 


92 


12 


1.8 


-1.0 


-0.6 


0.04 


RM6765 


0.0013 


0.01 


Plant height- increasing rate, % 






















qPHRl (w6/w4) 


1 


140 


70 


4 


—0.1 


—0.03 


0.27 


RM472 


<0.0001 


0.09 


qPHR2 (w6/w4) 


2 


114 


30 


-2 


0.2 


-0.1 


0.11 


RM5363 


<0.0001 


0.05 


(w8/w6) 




114 


19 


-2 


-1.0 


0.5 


0.06 


RM5363 


0.0228 


0.02 


qPHR6. 1 (w6/w4) 


6 


37 


15 


2 


0.1 


0.1 


0.06 


RM204 


0.0050 


0.02 


(w8/w6) 




37 


16 


2 


-0.3 


-0.2 


0.07 


RM204 


0.0003 


0.03 


qPHR6.2 (w6/w4) 


6 


125 


15 


-11 


1.7 


-0.2 


0.09 


RM5314 


0.0156 


0.02 



QTL, quantitative trait loci; Ch, chromosome; CIM, composite interval mapping; SMA, single marker analysis; ANOVA, analysis of variance. 
Measurements in parenthesis are used to detect the QTL. 

Likelihood ratio (LR) at the peak (cM) position, additive (a) and dominance (d) effects, and proportion of the phenotypic variance explained by the locus (R 2 ) were 
computed by the CIM program based on a subpopulation of 188 (N) F 2 plants. 

The marker nearest the peak was used to confirm the QTL with the whole population of 480-493 (N) F 2 plants by MA. The F-test probability level (P) and R 2 were 
estimated by one-way ANOVA. 



analysis to old cultivars/landraces, including those from aus and U.S. weedy rice populations are often classified based on HC and 

indica ecotypes (Reagon et al 2010), to help identify the founder AN presence/absence in practice. This simple classification system has 
genotypes of U.S. weedy rice. been shown in a good agreement with the classification based on 



Table 6 Summary of haplotypes for the 14 QTL-cluster regions in U.S. weedy rice and wild rice 





Cluster 




Weedy 


rice c 






Wild 


rice c 






Traits (Table 1) associated 


Code 3 Length, Mb fa 


N 


EM93-1 


US1 


others 


N 


EM93-1 


US1 


others 


Share d 


with the cluster 


CL1.1 


5.7 (17.6-23.3) 


3 


0.00 


0.29 


0.71 


6 


0.21 


0.29 


0.50 


3 (100) 


SS, SW, HC 


CL1.2 


3.2 (37.1-40.3) 


6 


0.00 


0.29 


0.71 


10 


0.00 


0.00 


1.00 


1 (17) 


TNR, PHR, SN, PH, SD, FT 


CL7.1 


4.1 (7.0-11.1) 


5 


0.00 


0.36 


0.64 


7 


0.00 


0.00 


1.00 


3 (60) 


SD, PC, FT 


CL5 


7.8 (3.0-10.8) 


10 


0.00 


0.29 


0.71 


10 


0.14 


0.00 


0.86 


1 (10) 


TNR, RTN 


CL4.1 


12.0 (7.9-19.9) 


5 


0.07 


0.29 


0.64 


6 


0.07 


0.43 


0.50 


3 (60) 


AN, SS 


CL8.2 


7.7 (20.2-27.9) 


8 


0.21 


0.21 


0.58 


8 


0.00 


0.00 


1.00 


2(25) 


AN, PH, PHR 


CL9 


7.0 (3.8-10.8) 


7 


0.25 


0.39 


0.36 


6 


0.00 


0.00 


1.00 


2 (29) 


SH, SN 


CL7.2 


8.2 (21.0-29.3) 


6 


0.32 


0.04 


0.64 


8 


0.00 


0.00 


1.00 


3 (60) 


SD, SH 


CL3 


9.3 (0.5-9.8) 


10 


0.43 


0.04 


0.53 


10 


0.00 


0.00 


1.00 


3 (33) 


AN, SS, PHR 


CL2 


10.3 (19.3-29.6) 


8 


0.43 


0.11 


0.46 


8 


0.07 


0.21 


0.71 


3 (38) 


PHR, AN 


CL4.2 


5.0 (21.9-26.9) 


6 


0.54 


0.18 


0.28 


8 


0.00 


0.21 


0.89 


3 (50) 


SD, HC 


CL6.1 


6.1 (3.2-9.3) 


2 


0.57 


0.43 


0.00 


6 


0.21 


0.29 


0.50 


2 (100) 


FT, AN, PHR, LSC, SD, PC 


CL8.1 


3.7 (2.1-5.8) 


6 


0.61 


0.04 


0.35 


8 


0.00 


0.00 


1.00 


2 (40) 


SW, HC 


CL6.2 


5.2 (22.0-27.2) 


3 


0.75 


0.18 


0.07 


4 


0.29 


0.14 


0.57 


3 (100) 


PH, AN, PHR 


Mean 




6.1 


0.30 


0.22 


0.48 


7.5 


0.07 


0.11 


0.82 


2.4 (52) 




QTL, quantitative trait loci; SS, seed setting; SW, see 


d weight; HC, hul 


color; 


TNR, tiller 


number- 


-increasing rate; PHR, plant height— increasing rate; SN, seed 


number 


per plant; PH, plant height; SD, 


seed dormancy; FT, flowering time; RTN, 


number of 


reproductive tillers/plant; AN, awn; 


SH, seed shattering; LSC, leaf sheath 



color; and PC, pericarp color. 

. Refer to Figure 3 for map/chromosomal positions of clusters (CL). 
Physical length (interval) determined by the flanking marker positions (Mb) on the reference genome sequence (Gramene 2012). 

Number of haplotypes (N) and frequencies of the EM93-1- and US1 -like haplotypes, and other haplotypes different from the EM93-1- or US 1 -like in the weedy and 
wild rice samples. 

Number (percentile) of haplotypes shared between the weedy and wild rice samples. 
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< 

X 
CD 

Q- 
=1 

2 
CD 



x 

CO 



D. 

O 

<5 



1.25 



1.00 



0.75 0.50 
Minimum distance between clusters 



0.25 



0.00 



EM93-1 
US1 (BHA) 
R05(BHA) 
R18 (BHA) 
R13 (BHA) 
R16 (BHA) 
R15 (BHA) 

■ R11 (BHA) 
W10(O. nivara) (BHA) 
W08(O.™Vara)(BHA) 
W09 (O. nivara) (BHA) 
R03 (BHA) 

R07 (FHA) 

R02 (BHA) 

R04 (SH) 

R06 (SH) 
' R17 (SH) 

R20 (SH) 

R21 (SH) 
| R22 (SH) 
I R24 (SH) 

R23 (SH) 

R26 (SH) 

R27 (SH) 

R12 (SH) 

R19(SH) 

R14 (SH) 

R08 (SH) 

R25 (SH) 

■ R10 (FHA) 
R09 (FHA) 

I W12 (O. rufipogon) (SHA) 
I W13 (O. rufipogon) (SHA) 
iW04(O. nivara) (BHA) 
I W06(O. nivara) (BHA) 
' W11 (O. rufipogon) (FHA) 
' W05(O. nivara) (BHA) 

■ W14 (O. rufipogon) (FHA) 
R01 (SHA) 

W03(O. nivara) (BHA) 

■ W07(O. nivara) (BHA) 

■ W02 (O. glumipatula) (BHA) 
W01 (O. glumipatula) (BHA) 



Figure 4 Phylogenetic relationship of selected U.S. weedy and wild rice lines. The dendrogram is developed based on the haplotypic data for the 
14 QTL cluster regions (Figure 3). The weedy red (US1 and R01-27) and wild (W01-14) rice lines belong to the four seed morphological types 
(Figure S1): black-hull awned (BHA), furrow-hull awned (FHA), straw-hull awned (SHA), and straw-hull anwless (SH). 



genomic patterns of randomly selected DNA markers (Reagon et al. 
2010) and the haplotype patterns for the QTL cluster regions (Figure 
4). This research identified three HC and six AN QTL and these nine 
loci distribute on nine different genomic segments of six chromo- 
somes (Figure 3). Thurber et al. (2013) detected two HC and three 
AN QTL from U.S. weedy rice, with two of the AN loci on the two 
additional chromosomes. Thus, phenotypic variation for the HC and 
AN morphologies is regulated by at least 1 1 functionally differentiated 
loci scattered on eight chromosomes. The genome distribution pattern 
of the multiple QTL explains the good agreement between the mor- 
phological and marker-based classification systems. It is expected that 
the morphological classification system will continue to be useful 
because of the genome distribution of HC and AN loci and their 
linkage with genes for the other adaptive traits in the clusters. 
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